Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support QDQ format for weight-only quantization #35

Merged
merged 17 commits into from
Sep 23, 2024
Merged

Conversation

mengniwang95
Copy link
Contributor

Type of Change

feature

Description

Support QDQ format for weight-only quantization

It requires:

  • onnxruntime >= 1.19.0
  • opset_version of model >=21
  • quantized bits in [4, 8]

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Wang, Mengni <[email protected]>
Signed-off-by: Wang, Mengni <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
Signed-off-by: Mengni Wang <[email protected]>
@mengniwang95
Copy link
Contributor Author

image

@thuang6 thuang6 merged commit 05bb58a into main Sep 23, 2024
14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants